Finding Salient Features for Personal Web Page Categories
نویسندگان
چکیده
We examine techniques that \discover" features in sets of pre{categorized documents, such that similar documents can be found on the World Wide Web. First, we examine techniques which will classify training examples with high accuracy, then explain why this is not necessarily useful. We then describe a method for extracting word clusters from the raw document features. Results show that the clustering technique is successful in discovering word grouops which can be used to nd similar information on the World Wide Web.
منابع مشابه
Enhancing Authentic Web Pages for Language Learners
Second language acquisition research since the 90s has emphasized the importance of supporting awareness of language categories and forms, and input enhancement techniques have been proposed to make target language features more salient for the learner. We present an NLP architecture and webbased implementation providing automatic visual input enhancement for web pages. Learners freely choose t...
متن کاملLocation matters, especially for non-salient features-An eye-tracking study on the effects of web object placement on different types of websites
Users have clear expectations of where web objects are located on a web page. Studies conducted with manipulated, fictitious websites showed that web objects placed according to user expectations are found faster and remembered more easily. Whether this is also true for existing websites has not yet been examined. The present study investigates the relation between location typicality and effic...
متن کاملClustering of Web Pages based on Visual Similarity
Finding the appropriate information on the web is a very tedious job. There is a need to organize the data by classifying the data into categories. This categorization of web pages can be achieved by clustering. The clustering is done by analyzing the content of the HTML page by extracting the keywords. Based on the keywords extracted the page is evaluated and clustered. But the visual feature ...
متن کاملJoint Web-Feature (JFEAT): A Novel Web Page Classification Framework
With the increasing amount of web pages over the internet, it has been a major concern to obtain information on the internet accurately at a reasonable cost with decent performance. A potential solution is through the classification of web pages into meaningful categories. An effective classification of web pages is of benefit to various applications such as web mining and search engines. Unlik...
متن کاملHybrid Adaptive Educational Hypermedia Recommender Accommodating User’s Learning Style and Web Page Features
Personalized recommenders have proved to be of use as a solution to reduce the information overload problem. Especially in Adaptive Hypermedia System, a recommender is the main module that delivers suitable learning objects to learners. Recommenders suffer from the cold-start and the sparsity problems. Furthermore, obtaining learner’s preferences is cumbersome. Most studies have only focused...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Networks
دوره 29 شماره
صفحات -
تاریخ انتشار 1997